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Translator's Preface 


Although probability theory owes its early development to interest in games 
of chance, mathematicians as well as philosophers and scientists have be¬ 
come increasingly concerned with questions concerning the applicability of 
the theory to an ever-widening range of natural and social phenomena. This 
has raised questions about the meaning of "’probability, questions about 
the interpretation of probability statements, and questions about the justili- 
abilityof the assumptions underlying various probabilistic models. Although 
there has been a recent upsurge in the teaching of probability on the 
secondary school and elementary college level, most of the attention has 
been devoted to the purely mathematical aspects of the theory (with 
special emphasis on the algebra of sets) and to straightforward applications 
to statistics. As a result, recent textbooks written essentially for the layman 
have ignored many of the fascinating problems related to probability 
theory which are of a philosophical rather than of a purely mathematical 
nature. 

Even though the reader may disagree with some of the views expressed 
by the author—the translator does—he should find that the controversial 
aspects of probability shed an entirely new and fascinating light on the 
theory. It is of special interest to observe that in the author's opinion the 
material in this book (including questions of objective vs. subjective 
probabilities, questions of basing “rational” decisions on probabilities, ques¬ 
tions concerning the appropriateness of equiprobability and other assump¬ 
tions, questions concerning the probabilities of causes, and questions 
concerning other controversial matters) should be part of everyone's 
secondary education. 

The problems in this book are numbered as in the French edition, with a 
few numbers missing. The only changes made in this translation were the 
substitution of some games of chance better known in the English-speaking 
world than the ones used by the author, and the omission of some footnote 
references to French works not readily available to most readers. 


JOHN E. FREUND 
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CHAP. 1 


4K>\e table by interchanging the middle outcomes (and thus bringing to¬ 
gether ail combinations leading to the same general results), we get the 
following table: 
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It can be seen that the player who bets on heads, for example, has I chance 
in » of winning three games. 3 chances in 8 of winning two games, 3 chances 
in 8 of winning one game, and I chance in 8 of winning none. 

In the same way one could construct analogous tables for four games. 
yhi e shall give only the second of these two tables, namely. 
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CHAP. 1 


It can be seen that the expression mathematical expectation must be 
regarded as a s{>ecial term having a well-defined meaning; one cannot look 
for iu meaning by referring to the usual meaning of each of the two terms 
expectation and mathematical. In the ordinary sense of the word, Paul has a 
certain expectation of obtaining 51,000. If this expectation is not realized 
he receives nothing at all. and in any case he will never receive 5230, the 
amount of his mathematical expectation. In spite of this, there are certain 
advantages to preserving the expression mathematical expectation, even 
though It is consecrated by usage; there is seldom any real advantage in 
introducing new terms. 

.A game is said to be equitable if the mathematical expectation of each 
player is equal to his stake. A mathematical expectation of a certain amount 
may thus be traded against this amount, if one can find a player disposed to 
plav an appropriate equitable game. For example, if 1 were to receive 52 in 
the w here a game of heads or tails yields head, my mathematical ex— 
-station IS SI and it can be sold at this price to a player disposed to take 
mv place. In this particular example it should be easy to sell the mathemat¬ 
ical expectation, but this would not be the case if the stake in a game of 
heads or tails were 5100,000,000. Sometimes the commercial value of a 
mathematical expectation exceeds its numerical value; this is because of the 
public s taste for lotteries and the restrictions imposed on them by law. It 
is easv to sell for a dollar a lonery ticket with a mathematkal expectation 
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Similarly, one has 


1 _ 

r\ 

1 

1 

c; 

CT- 

q — 1 

II 

n — q — 3 

cr" 


c; - 

n — 1 

s 



Cl = n 

The last equation expresses the fact that the single head can be in the first 
game, the second game, . . . , or in the last game, and this gives n possible 
combinations. Multiplying the respective sides of all these equations, one 
finally obtains the formula for CJ given above. 

It should also be noted that if one considers the product (H — Tr = 
(H ^ T)(H — T) • . . . * (H — T) and if one multiplies out the product on 
the right-hand side, taking care to preserve the order ol the factors in each 
partiad product, one obtains 2" terms corresponding precisely to all possible 
combinations of n letters which are either H or T. If one then combines 
similar terms, it is clear that the coefficient of equals C®. the number 

of combinations containing q letters H. One thus obtains the binomial 
formula of Newton, namely, 

(H - T)" = H" - ClH"-*T - ... - - ... - T" 

making use of the fact that C® = C"“®. The quantities C® are usually referred 
to as binomial coefficients, and they play an important role in many problems 
of probability. 

It should also be noted that, as a result of the preceding considerations, 
one has 

1 - Cl - Cl - ... - C: - ... - Cr* -1=2- 

since the total number of all fKJSsible combinations is 2". This result ma> also 
be obtained by expanding (1 — 1)" by means of the binomial formula of 
Newton. 

In Sections 3.2 and 3.4 we shall give formulas for obtaining approximations 
of n! and, hence, approximations of the coefficients Cl- These approximation 
formulas are all the more useful when n is very large. When n does not 
exceed 30, it is preferable to use a table giving the values of loe n': such a 
table may be found in most tables of logarithms. 


1.9 Certain Paradoxes 

We could end here our exposition of the essential principles of the theory 
of “Heads or Tails.” Once these principles are established, all consequences 
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equal to his stake. If we now suppose that Paul plays against the same 
opponent Peter an indefinite number of games, Paul could, without waiting 
until the next day, consider the game as interrupted after each favorable 
series and start recording again from that moment on. Since nothing dis- 
tinsuishes one moment from another and since one can always suppose 
that the same begins at some arbitrary instant, it follows that Paul will have 
an unlimited number of favorable series and. hence, an unlimited gain. 
This would be true, at least, if there were no limit to the duration of the 
same and to the duration of human life. 

Referring to the identical series of games, Peter could reason in the same 
way, and it follows that his gain is also unlimited, provided the game is 
played for a sufficient length of time. We have thus arrived at the absurd 
result that each player will realize a gain which increases with time. Clearly, 
this contradicts the original argument according to which some thousands 
of games would suffice to reestablish the balance between the two players. 
Let us be satisfied, for the montent, with having put the reader on guard 
against the drawbacks of a certain kind of reasoning. We shall study this 
question in more detail after we have acquired the necessary theory . 
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3.2 Stirling's Formula 

The follouing formula, due to Stirling, is most often used in practice to 
approximate «!; 

n! = \ 2»rn(l — c.) 

In this formula,* n is an arbitrary whole number, e is the base of natural 
logarithms ie = 2.71828 .. ?r is the ratio of the circumference to 
diameter of a circle (rr = 3.14159 ...), and c, is a number which varies 
with n. but which tends to zero when n increases mdefinitelv. In certain 
applications it is useful to know that n - e, tertds to I/I2 when n becomes 
large; in that case one lets 12n • €, = I — 9,. 


3.3 Application to the Etjuality of Players in Heads or Tails 


To give an immediate application, let us calculate the probability that two 
plavers are even after having played 2n games of heads or tails, namely, that 
there have been n heads and n tails. According to what we have already 
show n, this probability is given by 
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If we now use the formula of Section 3.2, we obtain 
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w hich can be written as 
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where 1 — tends to 1 when n approaches infinity, that is, e, approaches 
zero. It can'be seen that this probabilitv decreases when n increases, but 

that it decreases more slowly. , 

To measure the degree of approximation for small values of a, let us refer 

a 2 ain to Pascal's triangle. For n = 5 one has 
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approximately 1 40. 

• Derivations of this formula mav be found in more advanced l«ts: howewr. one can 
appN it without knowing bow it b proved. 
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SEC. 4.2 


THE LNIT DE'‘I».TIUN 


•:5 


Let us continue our calculations with the approximate formula, assuming 
now that < nr; omitting all terms containing nr with k 2 in the 
denominator, it follows that 


. 1 1 ^ 
log p = — - • log m- — - 

2 m 


and, hence, that 


P = ^=re-*'-(l 

X 

where and ij* are numbers which tend to zero when n is increased 
indefinitely. 

This formula generalizes the result obtained in Section 3.3 for h — 0. and 
it is of great importance. Let us stress the fact that it follows anahtically 
from the formula for combinations, and that it does not entail any other 
assumptions. 


4.2 The Unit Deviation 


It is customary to modify the above formula for p bv making the substitution 
h = /\ nt, where h is called the deviation (from m). The quantity \ m is 
called the unit deviation* and x is called the relative deviation. If we make 
this substitution, we obtain 

P = - r,J 

\ "I* 

and it can be seen that the exponential term deponds onlv on the relatise 
deviation /, which is essentially the reason for making this substitution. 

hen h and m are both very large, one can vary h by several units without 
appreciably affecting ).. Thus, let us designate by and h. two whole num¬ 
bers that are fairly close, and let us Took for the probabilitv that the 
deviation h is contained between A, and h^. More precisely, let us look for 
the probability that < h < tu. If we write A, = ;.,n m and A, 
assuming that /.^ and x, are so close that thev can be replaced, without 
error, by some number x in the exponential e-\ the desired probabilitv 
equals the sum of as many terms as there are values of h. Since these term's 
are all equal, the value of the probability is 

P = (A, — A,) - j— - e~^*{ 1 — 
which can also be written as 
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SEC. 4.3 


USE OF A CONTINUOUS VARIABLE 37 


4.3 Use of a Continuous Variable 

We have seen that the probability that the relative deviation ?. is contained 
between and can be approximated by the formula 

(1) P = e-"* 

yj-n 

where the relative deviations and Xj are given by the formulas h.^ = m 
and A, = Xyyi m. Since and A, must be whole numbers, it follows that X^ 
and X^ cannot take on all values on a continuous scale. 

However, when \//w is large, the values of A, which vary in steps of size 
may be looked upon as a continuous series, and one is led to replace 
formula (1) by the following formula for P: 


(2) = 

y‘lT Ji, 

It should be noted that by virtue of the Law of the Mean the value of this 
integral is actually given by formula (1), where A is a certain number between 
Aj and Aj. The advantage of formula (2) is that it does not require the assump¬ 
tion that Aj and A, are close together, for if one assumes that Aj < A, < A, 
and puts 


Pl.t = f e dX and P* , = ^ f dX 

yjTT Jii 

one has 

Pl.t + Pt.t = ^ f 

y/TT JAi 

This can be generalized to any number of successive intervals of the vari 
able A. 

The probability P that A is contained between —a and a is given by 


Pa = 



dX 


and when a increases indefinitely, the probability tends to the limit 


P^ = 




dX 
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It can easily be verified that the value of this integral is 1. If we put 
^ i f" e-'* dx = Xr dy 

TT J —X V ^ •/—® 

= 1 I * p e-'-»'* dx dy 

7T J —X J — cc 


we have 


and, after changing to polar coordinates and integrating over the entire plane, 
we set 


y= = lfV'’rdr.rja=l 

TT Jo Jo 


It is clear that if we use the exact formulas instead of the approximate 
formulas, the total probability that the relative deviation assumes one of 
the values within its domain must be equal to 1. But if we put 





then h can vary from —w to m, since m -r h (the number of games w^) can 
vary from 0 to 2m. It follows that / can vary from —\fm to +\/m and, 
hence, from 0 to m. Now, when m is sufficiently large, e"** becomes negli¬ 
gible w hen /? exceeds m; for example, if m = 100, e~”' is less than 10^. Such a 
quantity is negligible in most practical calculations; and consequently, one 
does not commit an error in extending the integral from — oo to oo instead 
of limiting it to the interval from —m to 4-m. It is remarkable that the 
value of the total probability thus obtained equals 1 even though the form¬ 
ulas used are only approximate. This shows that the small errors committed 
by using the approximation of Section 4.1 are mutually compensating. As 
we shall see, this is very important insofar as the application of the formulas 
is concerned. 


4.4 Definition of the Function 6(/.) 

It is customary to designate by e(Z) the probability that the relative deviation 
is contained between —A and -I- A; hence, one writes 


1 * 

0(A) = — dA 

yjn- J-i 

or, equivalently, 

»(;,)= A r.-d. 

yjTT Jo 


snecial tables of the function 0(A) have been calculated, and they are 
vttd^y te!. rapplieattons of the ea.eu.us of probability. Some values of 
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SEC. 5,4 


SOMt 4^^ 


PROBLEM 21. Referring to the inequitable game of Prohlen} 19, for /;(h: nainy 
frames is the loss which Paul has I chance in 4 of' exceeding a nniximuni 


Assuming that the relative deviation (in a specified sen^e) equaK the 
median deviation, we satisfy the conditions of the problem, since ilie lour 
hypotheses 

-oc < /. < -0.4769, -0.4769 < /. < 0 

0 < ;. < 0.4769, 0.4769 < /. < x 

are equally probable; each of them has a probability of I 4. Corresponding 
to this value of 7., the number of games won by Peter and Paul are. respec¬ 
tively. 



and Paul’s loss is 

2).\ 2npq — 2 a/? 

provided that this quantity is positive. It will become negatise when n is 
sufficiently large; that is, the longer the game is continued, the belter will 
be Paul’s chances of being ahead. The ma.ximum of the preceding quantity 
can be obtained by equating to zero its derivative with respect to n. If we 
first let p = q = 1/2, the quantity reduces to * 

7.\ 2/1 — 2a/j 

and we obtain 


Solving for n, we get 

„ _ il _ (0.48)- 
8a- 8a- 

in which case Paul’s loss is 


o 

/■ 


\2n -2xn = — 
4a 


(0.48)- 

4a 


Supposing as in Problem 20 that a = 0.25 36.5 and 
36.5 by 0.5 and 36, respectively, we obtain 


replacing 0.48 


and 


n 



648 


p = (0-^)' _ 36 
4a 4 


= 9 


It will be left to the reader to 
appreciable error. 


verify that these simplifications do not introduce an 
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altogether. Letting 

hi = IfiiPiqi and 


we shall now look for the probability that the de\iation h has a specified 
value, or rather that it is contained between h and h — dh. This can be 
realized with an arbitrary value .v of the deviation in w hich case one 
must have 

h < hi — /zo = A' — /zo < h — dh 


or 


h — A' < /z^ < /z — A' — dh 


If we let h — X = ?,\ A/z^/^o^o^ ^he probability that the deN iaiion h., is 
contained between the above limits is given by 

1 . 1 
—e '• d/. = - ~ e dh 

^77 ^ ^TT/Z2^2^72 

This is the value of the probability corresponding to the \alue .v of the 
de\iation In order to get the total probabilitv one must take the sum 
of the products obtained by multiplying these partial probabilities bv the 
corresponding probabilities for .r. Since the probability that .v is contained 
between .v and x — dx is given bv 


it follows that the probability that A is contained between h and h - dh is 


dh -^_ 

~ = - e J.v 

To simphty this expression, let us first evaluate the following integral; 




Making use of the identity 

- bx -c = a(x~ A j" _ 


and letting 


_a 


i ' = ^ a I .V- ^ ) 

' 2a' 


and 


M- = 


we find that the integral becomes 


4a 

4ac - b- 
To 


~(nx'—bx—c) 


dx = 
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-= e-'-'-'^-dv^ 

V a 


e ■' 


dv 


a .-X 
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CHAP. 6 


6.2 The Human Perspective 

VSe all know from everyday experience that people ignore certain small 
probabilities, even though the corresponding events may be of great impor¬ 
tance, say. a matter of life or death. Accident statistics for cities like Paris 
show that there is about one fatal traffic accident per day for each million 
inhabitants. We can thus accept one millionth as the probability that a 
Parisian who goes out into the street on a certain day will have a fatal acci¬ 
dent. If Parisians did not neglect such a probability, they would refuse to go 
out and the life of the city would be paralj'zed. 

One can give many other examples to show that a probability equal to 
or less than 0.000001 is generally regarded as negligible from the human 
perspectise. In spite of this, some people will buy lottery tickets to which 
they are attracted by the size of the grand prize, even though there may be 
millions of tickets. How ever, even in that kind of situation the lottery would 
have few clients unless it offered some second and third prizes: most sensible 
human beings w ill carefully consider their slim chances of winning the grand 
prize, if an enterprise requires a good deal of capital. 

6.3 The Terrestrial Perspective 

The number of human beings living on the earth is approximately two 
billion, and an accident whose probability is very small for an individual 
can nevertheless happen quite frequently. This happens, for example, in 
the spread of a contagious disease; the probability of catching the disease 
on a given day is very' small for a given individual, but if we consider a country 
as a whole, it can happen often enough to justify the implementation of 
measures designed to suppress or at least diminish the risk of contagion. 

Insurance companies, w hose clients number in the hundreds of thousands 
or millions, account for this in establishing their rates, even though the 
risks mav be very slight for each individual client. 

If then after having accepted a probability of 10^ as negligible from the 
human perspective, we divide this figure by IC^*, we conclude that a probability 
less than 10“’* is negligible from the terrestrial perspective. 


6.4 The Cosmic Perspective 

To evaluate what probabilities are negligible from the 

that is the perspective of our universe, one might assume an anthropocentnc 
point of view, r^elv, one might imagine the universe in the image of om 
manet and oJr solar svstem. The star closest to the s^ |S at a 
approximatelv three hght years. If one 

visible universe as three billion light years, one would have to agree that to 
nu^r of srars in this universe is of the magnitude of a bilhon cubed, that 
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is, and that it would thus contain at most 10®’ planets like the earth. 
A probability negligible from the cosmic perspective would thus be given by 
the quotient of the probability negligible from the terrestrial perspective 
divided by 10®*, which is 10~^^ 

It is not necessary, however, to take such a naive anthropocentric point 
of view. It would be more logical to turn our attention to natural phenomena 
which can occur in the universe, without having to compare them v\ ith phe¬ 
nomena requiring the existence of the human race on our planet. The 
smallest particles into which matter seems to be divided have a linear dimen¬ 
sion greater than 10“^ times the billions of light years which represent the 
linear dimension of the universe. The number of these particles would be 
less than the cube of 10'**’, or 10’“, even if they were assumed to occupy 
space without a void. On the other hand, the most elementary phenomenon 
such as the collision of two of the particles occurs (for a given particle) in 
an interval of time that is very small compared to a second. Nevertheless, 
fewer than 10^® such intervals elapse in the course of billions of centuries, 
so that the total number of these infinitely small elementary phenomena does 
not exceed 10’®® in the entire universe and during the longest period of time we 
can assign to the duration of our solar system. It is thus impossible to 
imagine that the simplest event could recur more than 10’®® times, and it 
follows that a probability of 10 “® is very largely negligible from the cosmic 
perspective. One can say with certainty that a phenomenon to which such 
a probability can be assigned has never been observed and will never be 
observed by any human in the entire universe. 


6.5 The Supercosmic Perspective 

The study of very elementary and very simple problems of probability can 
easily lead to the definition of probabilities much smaller than those nettli- 
gible from the cosmic perspective. We shall say that such probabilities arc 
negligible from the supercosmic perspective, and that the events correspond¬ 
ing to these probabilities may be regarded as rigorously impossible 
The probabilities we shall consider here are defined relative to books 

Sup^sc that an urn contains a number of balls, one for each letter of the 
French alphabet (and, perhaps, also one for each punctuation sieroroth^ 

noL^thelettTrrtT^nL^TrittnT^hrb^rt^^^ 

What is the probability tLt 100^! . ^ 

text, say, an Alexandrian verse, a trage?y“brLcine o“r“he“ '’''‘‘i"""’''’''* 

of Victor Hugo7 If. for the sake of L'plic^ty. we U ouVXfto 
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letters of the French alphabet and neglect all additional signs, we would 
have to draw the follow ing 37 letters to obtain the first line of Atha/ie: 

ouijeviensdanssontempleadorerleternel 

The probability that the first letter is an o is 1/26, the probability that the 
second letter is a u is 1/26, . . ., and the probability of obtaining the above 
sequence of letters is thus 26“^, which is less than 10-“ If, instead of 
considering a single line, we considered a poem of 1,000 lines, the correspond¬ 
ing probability w ould be less than 10-“ “®. If one considers a library of a mil¬ 
lion volumes, where each volume contains on the average a million letters, 
the probability of reconstructing these volumes exactly by means of random 
draw ings w ill be less than a pow’er of ten whose negative exponent, in absolute 
value, exceeds a thousand billions. This is the probability of what 1 have called 
the “miracle of the dactylographic monkeys”; namely, the probability that 
thousands of monkeys, randomly typing on typewriters, reproduce exactly 
the entire National Library. 

Everyone will surely agree that such a miracle is absolutely impossible. 
Nevertheless, men of science and mathematicians in particular are accustomed 
to distrust assertions based on judgment or common sense. They do not 
forget that during the course of time, even quite recently, common sense 
denied the existence of antipodes, or the movement of the earth around the 
sun. It is thus indispensable to examine what might be called the argument 
of the mathematician. 


6.6 The Argument of the Mathematician 


This argument is very simple and of a purely arithmetical nature. Let p be 
the probability of an event, where is a nonzero positive number which can 
be very small. Given such a number p, how often must one repeat the experi¬ 
ment to be almost certain that the corresponding event will occur ? If the expen- 
ment is repeated n times, the probability that the event will not occur even 
once is given by P = (l -/>)" according to the principle of composite 
probabilities. Furthermore, if p is very small one commits a negligible error 
if one replaces 1 — p with e **, and this gives 

P = e-"" 


If ,he product pn is sufficiently large, say, 1,000, the probabi tty ^ ^ 

^xtrerndy sntaU and there will be an almost absolute certamty that the 
event (whose probability is p) will occur at least once. It will thus^ffiw to 
repeat the experiment with the dactylographic monkeys n times to P” 
cX certain that the ••miracle" will occur at least once, that is, that he 
monkeys w m reproduce, without a single mistake, all of the volumes of the 

’^Th°eXcXuXions are incontestable, as are all calculations that do not 


k| - -- 

- 
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entail any approximations. Nevertheless, it is not only contestable but ab- 
surd, if one tries to imagine the concrete significance of the abstract uords 
in the phrase “it will suffice to repeat the experiment n times,” \vhen n equals 
a power of 10 whose ex|>onent exceeds a thousand billions, namely, a number 
having more than a thousand billion digits. In fact, we have seen that il vse 
turn our attention to the most elementary and the most simple of exp>eriences 
which are by far the most frequent (such as collisions of infinitesimal par¬ 
ticles, the ultimate elements of matter and energy), and we do so over an 
interval of time exceeding by far the life of our solar system, the number of 
times that an experiment can be repeated does not exceed a power of 10 
whose exponent is less than a thousand. The innocent phrase “it will suffice 
to repeat...” is as ridiculous as it would be for a Christian to say, “It 
suffices to be God the Father.” 

Some philosophers will ask whether the universe is not really infinite; but 
if this is so, all of the most complicated combinations such as the arrange¬ 
ment of all the atoms which at this hour actually constitute the terrestrial 
sphere are not only possible, but are realized an infinite number of times 
even though their probability is very small compared to that of the miracle 
of the monkeys. But again, one must assume that in this universe w hich we 
try to imagine there exists a certain homogeneity, in the sense that the simple 
elements which constitute our universe are able to form the same combi¬ 
nations throughout the entire extent of the infinite universe- There is really 
no need to go into the unreasonableness of such an anthropocentric picture 
of an infinite universe. If we designate by f/| our universe w hich extends as 
far as some billions of light years, and (according to Boltzmann) by t, a 
universe which compares to L\ as f/, compares to an atom, by t'j a universe 
which compares to as compares to t/|, and so on, we would be forced 
to admit that we could never know anything about C'j, even if we knew L\ 
perfectly. In particular, the hypothesis that is composed of a great 
number of elements analogous to Ui is extremely improbable. All specu¬ 
lations concerning C4, (/g,.... and even more so concerning i\ and 
^1.000.000 2re pure metaphysics and not science. Evidently, there is nothing 
to prevent a metaphysician from asking whether the particular combinations 
connected with what we call “life,” which are undoubtedly very improbable, 
occur only a few times or if, though extremely rare, they occur infinitely 
often. Once one admits the concept of infinity, one might as well assert that 
infinity is sufficiently copious to make all this possible. Of course, this 
attitude is entirely contrary to the scientific spirit, which wants to gam 
knowledge about the universe as it is, and not as it could be. 

6.7 Application to Repeated Trials 

The application of the law of chance to the problem of repeated trials 
permits the derivation of certain results which are practically certain (from 
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the humin perspectivei or c\en absolutely certain (from il^ peispec- 

mei. ^^e kno» that the probability that the absolute \aJne of the n^ative 
deviation exceeds a b I — 6|iK whidi b 10"“ for i = 5 and of the order 
of 10“* for i = 10: for i. = 4.06 it b 10^. Thus, values of i. exceedii^ 4 
are seldom observed and values of i exceeding 5 are practkaDy unobserv¬ 
able from the human and even from the terrestiial petspecdve. So far as 
values of i. exceeding 10 are coaceroed. they are impossible frmn the twcmir 
perspective. 

Given 1.000.000 repeated triab for ahicfa^ = ^ = I/*!, the unit deviatioob 
N 2np^ = \ 500.000 or appro.ximatdy 700. Thus, a deviatioa of 2,800 b 
very excepciooal aiKl a deviatioa of 3,500 b practically impc^sible; so far as 
a deviation of 7,000 b cooccmed. h must be considered as rigorously im¬ 
possible. The number of successes aiO practically always be coamined 
between 49~.000 and 503.000. and it b absolutely certain that it will be 
contained between 490.000 and 510.000. Anodpatii^ the theory of the 
probabrbty of causes (Part llll. one might conclude from Uib. for example, 
that since one has observed 51 per cent male births among millions of births, 
it b absolutdv certain that the respective probabdities for male and female 
births are not equal: if tbcv were equaL such a large deviatioo wtjuld be 
impossible. 

The law of chance also clarifies the law of large numbers for aO c a s e s 
where the number of triak b fi xed. The number of successes must cenaiidy 
be contained between mp - Inpq, and hence t he pr oportion of successes 
must be contained between the limits p — S\ 2pq n. It must be added, 
however, that ihb result applies only to a series of weD-determined trials: 
that IS. the number m must be regarded as gnen and the n trials must actually 
be realizable (which would not be the case ifu exceeded 10»'^ The study 
of an infinite sequence of repeated triak b a different matter; ft b 
thcofetkal interest, since an infinite sequence of triak cannot P<»s ib^ be 
realised This is connected with the theory of dcmuncrablc prohaba^ 
which we shall not be able to discuss in detafl. Ncv^erthelcss, it wdl be 
appropriate to sav a few words on thb subject, because of the importance 
it has assumed in modern theoretical research in probability. 


6 S 4 5>mpie FroNem of DtmomenMe Frobabiiaies 

The foBowine » the simplest problem in the ^ 

^r.r^ne an infiniT^uence of repeated triak 'b.^ 

I 2... (thit IS. them « a denumerable ^ 

a success in the uth trial is VIhat k the probabd.^ ^ 

inlimtelv manv successes? Tbc answer to 

and It depends onh on the convergence or divergence of the senes 


(U 


S = Px~Fy 
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If the series S converges, one is said to be in the case of convergence, and 
the probability F that there will be infinitely many successes is equal to zero; 
if the series S diverges one is said to be in the case of divergence and the 
probability F is equal to 1. 

Complete proofs of this may be found in other books. The abbreviated 
version we are going to give here is equally rigorous, however, at least for 
those who are familiar with the concept of mathematical expectation. 

If the series (1) converges, its value is less than some integer a, and it may 
thus be decomposed into b partial sums (one of which is the sum of infinitely 
many terms), where each partial sum has a value less than a, and where h 
is an integer which may exceed a, but which is less than 2a. 

On the other hand, if the series (1) diverges, it can be decomposed into 
infinitely many partial sums, each of which has a value greater than 1. 

Let us now concentrate on one of the partial sums 

s = Pi -r pz ~ -T Pk 

whose value is less than 1 in the case of convergence and greater than I in 
the case of divergence. The mathematical expectation of a player who 
receives an amount equal to unity for each success in the A: trials is evidently 
equal to s. It follows that there would have to be on the average less than 
1 success when s is less than 1, and more than 1 success if j is greater than I. 
Consequently, in the case of convergence a success will occur on the average 
at most b times, and in the case of divergence a success will occur on the 
average infinitely often. This is precisely the result we asserted above. 

It is known that the boundary line between convergence and divergence 
cannot be given precisely by means of a denumerable infinity of criteria, 
but it is also known that the logarithmic criteria of Bertrand make it possible 
to form convergent and divergent series which are remarkably alike, at 
least for all the terms one can actually write. 

To formulate such series, it will be convenient to designate by log .v a 
number equal to the logarithm to the base 10 of x when .v is greater than 
or equal to 10, but equal to 1 when jr is less than 10. With this convention, 
the iterated logarithms 

log (log x) = loga X 
log (loga x) = loga jr 

are defined for all integral values of jc and they are at least equal to 1. 

Let us limit ourselves to the simplest criteria of Bertrand: the series 


n • log n(loga n)* 

converges, whereas the series 


n • log n log* n 

diverges. However, if n is less than 10“ the corresponding terms of the two 
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where /I is a numerical constant whose value does not matter. If one puts 

(4) K = N^2 1og/i 

where log designates the Naperian logarithm, formula (3) gives 

(5) P. < 

and we find ourselves well in the case of convergence. One can thus assert 
that the probability for infinitely many successes is equal to zero; this kind 
of success can occur only a finite number of times.* In any case, one can 
assert that beginning with a certain value of n, the relative deviation will 
be less than v2 • log/i, which goes to say that in a series of n repeated trials 
with the constant probability p = \ — q, the number of successes will 
definitely be contained in the interval 

(6) np ± V 2 • log n \2npq 

starting wdth some value of n. Correspondingly, the proportion of successes 
will be contained in the interval 

(7) p ± VW l'°8 " 

y/n 

where log designates the Napierian logarithm. 

Formula (7) expresses the fact which Kolmogorov and other mathema¬ 
ticians have called the strong law of large numbers. It can be seen that it is 
an immediate consequence of the fundamental theorem of the theory of 
denumerable probabilities. 


6.10 Conclusion 


It can be seen that the domain of applicability of the law which we have 
referred to as the law of chance is very large; the law covers practically all 
problems of discrete probabilities one can effecUvely pose. 

At times, the theory of denumerable probabilities seems to conflict with 
the law of chance, but one must never forget that it is a purely mathematical 
theory, and that it has no practical basis since it assumes the realization of a 
denumerable infinity of trials. At the end of Part II, which is devoted to 
continuous probabilities, we shall see that such a denumerable infinity of 
tnals is equivalent to the choice of an element (a point) in a continuous set. 
pws presents another point of view, from which one can view the theory of 
denumerable probabilities and, as a consequence, the law of chance. 


• Using the reasoning based on mathematical expectations employed in Section 6 8 it 
can easily be shown that this finite number is certainty veiy small.^^ ’ 
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and corresponding to A: > 1 2 (Figure 7) one has 

.4A' = k/, .r/f = HL = (I - ky, PL = l, L.\ = l(2k-l) 

In both figures, the shaded area equals the rectangle AA'B'B — the triande 
A’B'I — the triangle LAY, namely, 

AB - .4.-1' — - {A’B f — - (LA*)* 

4 4 

However, since L.V equals —/(I — 2L), one has 

AB - AA' = Pk 
(A'Bf = P 
(LSf = P{1 -2ky 


for both figures, and the probability (obtained by dividing the shaded area 
by P) is thus 


k — 


1 - (1 - 2k)- 

4 


= 2k —Ir 


The following is another way of arriving at the same result; letting AM = x 
and AM' — v, the probability that AM is contained between .x and x — dx 
and M' is contained between y and y — dy is equal to dx dr P. Representing 
the point P bv its rectangular coordinates x and y (see Figure 8), we find 
that this point must be inside the square OABC of side /, and the probability 
that it will fall in a given region must be proportional to its area. The 

condition imposed by the problem is that 
j-f _ y| < kl, which requires that P must be 
contained between the lines DE and FG, 
parallel to OB, whose equations are 

x-y=-kl 

The desired probabiUty is thus equal to the 
ratio of the area of ODEBGFO to the area of 
the square, and since the two right triangles 
A DE and CFG form a square of side (I — kd, 
since OD = OF = kl, it equals 

r--(\-kn _ 2Jt _ It* 

nOXTRE 8 ” 
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8.4 Problems Concerning the Position of a Point on a Circle 

PROBLEM 24. Two points Af and M' are randomly selected on the circum¬ 
ference of a circle. What is the probability that the smaller of the two arcs 
AIM' is less than x, where x a n? 


This problem is easier than the preceding one, since one can arbitrarily 
fix the position of M without affecting the probability. Thus, M' must fall 
on an arc of length 2x whose midpoint is at M, and the desired probability is 

^ _ a 

277 77 


The probability that AfAI' is contained between x and x + dx is thus dxjn, 
and the expiected value of MAI' is given by 



PROBLEM 25. If n points AI^, AI ^,.. . , and AI„ are randomly selected on the 
circumference of a circle, what is the expectation of a player who is to receive 
as many dollars as there are arcs M.Mt l^ss than x? 


We have already seen that the probability that one of the arcs is less than 
X is x/77. Thus, if there are n points the number of combinations (of n objects 

taken two at a time) is ^, and the mathematical expectation is 

n(n — \)x assumes that x equals one degree and if one lets n = 25, 

2^^ 25 • 24 1 _ 5 

this expression becomes —-— • ^ • 

The mathematical expectation is greater than one, although the probability 
that at least one of the arcs is less than one degree is less than one. This is 
accounted for by the fact that the player will receive an app^iable amount 
(perhaps as much as 300 dollars) when many of the arcs are less than 

one degree. 

8.5 Problems Concerning the Position of Points in a Plane Region 

PROBLEM 26. Two points A1 and M' are randomly selected i"^‘de a square of 
side a. What is the expected value of the square of the distance MAI . 

Designating the coordinates of A/ by x and v, ^ ^ 

v', we have only to evaluate the following multiple integral 

1 r- p r- r _ x*)* + o- - /)*i ^y’ 

a* Jo Jo Jo Jo 
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In successive steps one obtains 

f [(X - X'f ^ 0--/)*] 
Jo 

and the desired mean value is a^j3. 


x'r - x'^* , 


- a(y - x-r- 


- o\y - yr 


8.6 Problems Concerning the Position of Points on a Sphere 

These problems are of special importance for, as we shall see in Part 111, 
they tie in with various questions concerning the distribution of stars on the 
celestial sphere. 

PROBLEM 27. Two points M and M' are selected at random on the surface of 
a sphere. What is the probability that the smaller of the great circles MM' 
is lesf than a ? 


This probability is the same regardless of the position of .M and, hence, 
if M is fixed M' must fall on the curved surface of the segment which corre¬ 
sponds to a semicentral angle MOA = % (see 
Figure 9). Designating the radius of the sphere by 
Ry one thus has MP = OM — OP = R{\ — cos x), 
and the ratio of the area of the curved surface of 
the segment to the area of the sphere is 

MP _ 1 — co s X 
2R ~ 2 


. 2 * 
= Sin - 



FIGURE 9 


This is the desired probability. If x is very small 
one can replace sin a/2 by a/2 and approximate the 
probability by a*/4. 

In connection with the method just presented, 

Bertrand indicated an alternate kind of reasoning 
which leads to an entirely different result. Given two points M and M', the 
pat arcle joining them is determined, and since all great circles on the sur- 
face of a sphere are equivalent, the probability is not affected if one fixes the 

the probability that two 

pints on a arcle are such that MM' is less than a equals a ir, and this differs 
from the result obtained before, especially if a is v^ small. If a is 1 degree 
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Letting x = l}\ this integral becomes 

2/ r ^ 

— I arc cos y ay 
ATT Jo 

and the desired probabUitv equals atr*2A since the integral equals 1. To 
verify this one has only to put v = cos i and one obtains 


j* arc cos y dv = i — t * sin t dt — (r ■ cos t sin n, . 


= 1 


In particular, if one assumes that 2J = a, namely, that the length of the 
needle is half the distance between the parallel lines, the probabiliu becomes 
equal to l/ir. Thus, if one performs a sufficient number of trials, one obtains 
an approximation of the value of This has been tried many times and it 
ha<; given perfectly satisfactory^ results, as satisfactory as it would be to 
estimate the ratio by means of drawings from an urn containing 314,159.265 
balk of which 214,159,265 are black and 100,000,000 are w hite. For 4.000 
trials the unit deviation is obtained by putting n = 4,000, /> = 1 tt, and 
^ = (tt — l)/iT, and one gets 


^ 2npq = 


^ 8,000(ff — 1) 

TT 


which is a little greater than 10. Supposing that the value is 10, the error 
for I/tt would be in the neighborhood of 1/400 and the error for — w ould be 
in the neighborhood of 1/100. Thus, the first decimal should be correct and 
the second decimal should be within one unit. This is the degree of 
approximation one can expect for 4,000 tosses of the needle. Such a result, 
repeated several times, w'ould suffice to confirm the validity of the principles 
involved and to indicate a mistake in a method w hich would lead to a result 
appreciably different from 1/w. 

A rather ingenious argument, due to Barbier, enables one to calculate 
without difficulty' the mathematical expectation of a player w ho receises 1 
dollar for each point at which the needle crosses one of the parallel lines. 
When / < a, he cannot get more than one such point and in this case the 
mathematical expectation is identical with the probability. 

Let us consider an idealized object consisting of a broken line (open or 
closed) which is throwm at random on the parallel lines. It is clear that the 
mathematical expectation of a player who receives 1 dollar for each point 
of intersection is equal to the sum of the mathematical expectations for the 
individu^ sides of the polygon, which in turn equal the sum of the analogous 
expectations for the equal segments into which one can divide the sides 
(supposedly commensurable by the usual classical reasoning). It follows 
that the ^sired mathematical expectation is proportional to the length of the 
broken line. Now, if this reduces to a circle of diameter 2a. the exp*ectaiion 
equals 2, since there are always 2 and only 2 points of intersection. The 
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Calculating the difference between the ratios and p„ one obtains 

,_ Pn~\ Pi _ 

Pn Pn I „ I I ^ 

-r ^2 + • • • + fln 

and this quantity becomes smaller and smaller when n becomes large. 
Designating this difference by we now have 

p„ < 1 < p'„ and Pn — Pn = 

It follows that each of the differences p'^ — 1 and 1 — /?„ is less than 
The same reasoning applies also to the decreasing portion of the curve and, 
hence, to the total ratio. Thus, the required probability equals 12; this 
result is approximate under the given assumptions, but it is to a large 
measure independent of the choice of the arbitrary function. 


PROBLEM 33. If one constructs a table of a function <f {x) for values of x 
increasing by very small steps {say, for x = 0, 0.00001, 0.00002, 0.00003, etc.), 
what is the probability that the second decimal of an entry in this table is a 7 ? 


Note that the problem refers to an arbitrary function q{x), which need 
not be given. Making use of the law of the mean one has 

<p{x^ — <f{Xi) == (Xa — 


where f denotes a number between and x^. For example, in order that 
the second decimal be a 7, it is necessary that 9:(,v) be contained between 
1.37 and 1.38.* According to the preceding formula, the interval of varia¬ 
tion for x is thus 


O.OI 

.X, — x, = - 

v’{^) 

Assuming that x varies by intervals of 0.00001, we find that the number 
of values of x contained in this interval is given by the integral part of 
l,000/9:'(^), where f is such that ^(1) is contained between 1.37 and 1.38. 
Let us denote it ^37. The total number of values of .x for which the second 
decimal digit is a 7 is thus given by 


^ 1,000 1,000 1,000 

’ <^'(^27) <F'(^37) ^ ■ 

where the number of terms is determined by the extent of the variation of 
the function 9? within the table. Similarly, the number of values of .x for 
which the second decimal digit is an 8 is given by 


1,000 ^ 1,000 ^ 
y (fis) ^ 


to four decimals, for example, the 
1.J6995 and 1.37995, this makes no appreciable difference. 
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The ratio of the probability p. that the second digit is a 7 to the prob¬ 
ability that the digit is an 8 is equal to the ratio but if one puts 


= d)(.x) 




one has 


£7 ^ 

-^8 2 < 1 >( I 38 ) 


Assuming that <l>(.x) is continuous, reasoning analogous to that used in the 
preceding problem proves that this ratio is very close to 1; the probability 
p- is thus approximately 1/10. 

-As an application one might determine how often a given digit occurs in 
the fifth decimal in a table of logarithms given to seven decimals. One would 
find that the probability for each digit is very close to 1/10. 

9.3 Application to the Distribution of the Minor Planets 

If one wants to investigate how the minor planets are distributed over the 
zodiac, the problem may be simplified by restating it as follows: 

PROBLEM 34. If one considers n points moving uniformly (but with different 
speeds) along the circumference of a circle, what is the mathematical expecta- 
tion of a person who is to receive as many dollars as there are points on a given 
arc ? 


Note that in a given instance the probability that any one point is on the 
arc is proportional to the length x of the arc and, since its motion is uniform, 
the time a point spends on the arc is proportional to its length. The mathe¬ 
matical expectation is thus ctll-n for each point and nxll-rr for all n points. 
One could be satisfied with this solution, which I have also discussed in one 
of my other books, but let us examine how Poincare approaches this subject: 
he looks for the mean of a periodic function of the curvilinear abscissa x of 
one of the points—say, for example, sin x. It is clear that the corresponding 
mean for n points approaches zero when their distribution becomes more 
and more uniform. Now, if one denotes by a^ the abscissa of one at the 
points at the time / = 0 and by b^ its angular velocity, its curvilinear abscissa 
is flt — b,^ and the desired mean value becomes 


- ^ sin (a* -f bki) 


n 


Assumin 
consists of 
number of ^ 
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bt: is contained between y and y — dy, one can write 

2 sin (a* + btt) = JJ sin (x -r /y) 9 r( v, y) dx dy 

n = jj <f{Xy y) dx dy 

where both of these double integrals extend over the entire domain for 
which qix,y) is not equal to zero. The desired mean value is thus given by 


JJ 


sin (x -t- ty)(f(x, y) dx dy 


JJ 


<fix, y) dx dy 


and it can easily be shown that when t increases indefinitely the mean value 
tends to zero. Thus, even if the distribution is not uniform at the start, it 
tends to become uniform. 

This argument has been given in spite of certain objections which I have 
discussed elsewhere. These objections are analogous to those which arise 
in the study of the kinetic theory of gases, and I shall not discuss them here. 


9.4 Application to the Kinetic Theory of Gases 

The study of the motion of n molecules in three-dimensional space may be 
looked upon as the study of the motion of one point in 3n-dimensional space. 
The most important features of this problem may be brought out by studying 
the motion of a point in a space of only two dimensions, that is, in the plane, 
and we shall limit ourselves to this case. 

Suppose that a point mass is moving rectilinearly in the plane and that x 
and j are its rectangular coordinates. We shall look for the probability that 
the integral parts of x and y are even integers; that is, the probability that 
X and 7 are contained between 0 and 1, between 2 and 3, between 4 and 5, 
and so on. 

If we divide the first quadrant into a network of squares of side 1, we can 
shade the squares for which the preceding condition is satisfied and state 
simply that we are looking for the probability that the moving point is 
inside one of the shaded squares (see Figure 15). 

It is worth noting that if one considers a point mass restricted to uniform 
rectiUnear motion inside a square and reflected in accordance with the 
classical laws of mechanics, its movement is entirely analogous to that of 
the point mass we have considered. To indicate the identity of the two 
problems one has only to suppose that one observes the motion of a point 
inside the square by means of mirrors erected perpendicularly on its edges 
Note also that the problem would be the same if we consider a point mass 
inside a cube which is reflected in accordance with the classical laws, and 



98 THE INTRODUCTION OF ARBITRARY FUNCTIONS 


CHAP. 9 


focus our attention on the projection of the point on a face of the cube. 
In what follows, we shall limit ourselves to studying the problem as it was 
formulated first. 

If I and iS are the coordinates of the point at r = 0 and u and c are the 
projections of its velocity on the coordinate axes, its coordinates at time 
t are given by the formulas 

X = X — Ut, y = I t 


.Mthough it is quite interesting in certain respects, we shall not discuss 
the case w here *, u, and r are known without error. Some arithmetical 



properties of these numbers—^for 
example, the question whether they 
are commensurable or not—^would 
enter this discussion. Of course, this 
would not be a practical problem, 
since the assumption that a number 
is known with absolute precision is 
a purely theoretical concept without 
any real significance. In actual prac¬ 
tice, one must regard a, u, and r 
as known only to a certain degree 
of approximation, although for the 
sake of brevity we shall assume that 
X and ^ are known without error; 


this will have no appreciable effect on the results. Thus, we shall designate 
by ( and ij the maximum absolute errors for u and r and write the preceding 
formulas as 


.X = a — u’t, ju' — ul < € 

y = /? - c'r, |r' — rl < i? 


If <n is the point whose coordinates are a and f is the point whose 
coordinates are » - u' and ^ - i'. and u' and r’ assume all possible values 
(that is u’ varies from m — € to u -r c and r' from r — ij to r -b »?), the 
point ^'fills the interior of a rectangle MNPQ, whose sides are of length 2e 
and 2ri, respectively. By assumption, the point | can occupy any position 
inside this rectanele, and we shall assume that the probabiUty that it is 
in a certain region is proportional to the area of that region. Using the 
argument of Poincare (see Section 7.3). it can be seen that this ass'jmpU^n 
IS “not essential, but it nevertheless simplifies the reasoning as well as the 


calculations. j- 4 - u’l 

It is easy to see where the point f', whose coordinates are x -« + u / 

and i = d - r'r is located. This point is homothetic to I, with the center 
of Dfokctliiv bine the point «. and the ratio of homotheticity being etpial 
fo "rn the' So, ofthe reeungle M NF Q-. which U homoUteoc to 
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the rectangle MNPQ in the same ratio. We had to draw Figure 15 so that 
t is only a few units, but it is easy to visualize what happens w hen t equals 
thousands or millions of units. No matter how small MSPQ might be. the 
rectangle M'N'P'Q’, whose dimensions are t times larger, become^, very 
large compared to the squares of the network, and the shaded squares 
occupy nearly one-fourth of its area. The validity of this argument in¬ 
creases with /, since the areas of the squares partially inside and partially 
outside become negligible compared to the total area ol the squares that lall 
inside. 

This demonstrates how time brings about an equalization of probabilities, 
so long as there is an initial indeterminance, no matter how small. For 
small values of t —for example, for the rectangle M'N'P'Q' of Figure 15— 
one might obtain a shaded portion less than 14. This does not equal the 
theoretical value, but there is a tendency among the probabilities to equalize 
when t assumes greater and greater values. It is by means of an analogous 
mechanism that molecules, as one considers them in the kinetic theory, 
tend to arrange themselves in a distribution which calculations show to be 
most probable. One cannot actually demonstrate that such a most probable 
distribution is realized at a given moment, but if it is not realized one can 
say that the practical probability of the existing distribution is zero. One 
can demonstrate, however, that regardless of what distribution there may 
exist at a given moment, the probability that the most probable distribution 
will be realized at a later time is so close to I that it can be put equal to I. 
The same reasoning applies also to prior moments, though with the following 
difference: When one calculates a future state from a present state, the 
present state is not rigorously known, and the small limits of error within 
which the present state is known make it possible to calculate various 
probabilities concerning the future state. After ail, one cannot really speak 
of probabilities unless something is unknown. On the other hand, if one 
wants to reconstruct a prior state from a present state, the uncertainty is 
greater; in the preceding case where we knew the prior state, we learned 
more about the present state than we would have by a direct study of that 
state. For example, if one has two containers, one empty and one full, 
and if one allows their contents to mix by opening a stop, the resulting 
state will not permit any conclusions about a state prior to the opening of 
the stop. It is known with a precision exceeding experimental possibilities 
that the most minute modification in the present state could well lead to an 
entirely different prior state. Thus, it is impossible to define and calculate 
the probability of a prior state by a method analogous to the one used above. 


CHAPTER TEN 


Errors of Afeasurement 


10.1 Formulation of the Problem 

We shall not go into all questions arising in a complete theory of errors of 
measurement, since this would require a book in itself. In particular we 
shall not give any of the details concerning the mathematical calculations 
required to deduce from a number of measurements the value of the quantity 
thev are supposed to estimate. Neither shall we go into the theoretical dis¬ 
cussions surrounding the law of Laplace-Gauss or the mathematical develop¬ 
ments to which these discussions have led.* The imp)ortance of the results 
thus obtained is not proportional to the analytical effort required, and this 
is one of the main reasons for the practical pseudouniversality often attached 
to the law of Laplace-Gauss.t 

Having pointed out this reason, we shall limit ourselves to the develop¬ 
ment of the most important theoretical consequences of the law of Laplace- 
Gauss Amone the three parts which constitute the complete theory, namely, 
the derivation^of the law, its theoretical investigation, and its applications, 
we shall limit ourselves mainly to the second. 


• For a long time this Uw has been referred to as the law of Gauss. Although the law 
was first formuUted bv Laplace. Gauss* research in this area has become classic. 

♦ The following is the main reason why it is best not to go into these quesuons of 
cated^hrtical Masoning (the integration of differential equatiom. 

slight modification in one equation can <*ange tne enure a y 

1(X> 
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10.2 The Law of Laplace-Gauss 

Suppose one wants to measure a quantity, say, a length, whose true \ alue 
is a; if the measurement equals a' the error is o — a, and according to the 
law of Laplace-Gauss the probability that this error i> contained between 
X and X — dx is given by 



where k is a constant called the precision of the method ol measurement 
being used. 

This formulation of the law calls for several observations. First ol all, 
what does one mean by the true value of the quantity that is being measured ? 
Since this value is usually unknown before any measurements arc made, 
is it not a vicious circle to evaluate the errors of measurement with the aid 
of a number furnished by the measurements themselves? We shall not 
dwell on this problem, which can be tied in with the probability of causes; 
let us merely indicate that there are situations where this difficulty does not 
exist. For the moment, at least, we shall limit ourselves to such cases, namely, 
to cases where the quantity to be measured might be known from prior 
measurements (or from theoretical considerations) to a greater degree of 
precision than that provided by the measurements themselves. A situation 
like this might arise in physics or chemistry where numerous students are 
asked to determine a physical constant well known in advance. 

The same can be said for the constant k appearing in the formula of 
Laplace. It is generally known only after some observations have been 
made, and we shall indicate later how' it might be estimated. 

The law of Laplace gives the same probability to positive and negative 
errors, that is, to errors whose absolute values are the same. It is only when 
this symmetry is realized that one can assume that the law applies. 

Finally, let us observe that if .t is replaced by a function of .t the analy tical 
form of the law can be modified. This objection is analogous to the ones 
we have already met and it must be answered in the same way; by the obser¬ 
vation of facts. For instance, if one uses a ruler to measure a length, the 
various parts of the ruler are homogeneous, and it is not the square of the 
length that is being measured, but the length itself. Thus, it is relative to 
the length, and not to the square of the length, that the error pertains. 

Nevertheless, when the errors are small the formula of Laplace preserves 
its form to ^first-order approximation. If we let 

b = 9 r(a), b’ = <f{a) 

x = a - a, y = b' - b 


we have 


y = <f\%)x 
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having two different values, one can regard the two errors as resulting, so 
to speak, from the two distinct causes having distinct probabilities. 11 the 
error produced by a cause varies continuously, one can decompose its 
domain of variation into a number of elements which are sufficiently ^all 
so that within each element the error may be regarded as constant. Then 
one designates by the number of possible errors whose value is and 
whose probability is p^, by n.^ the number of possible errors whose value 
is 02 and whose probability is and so on. The numbers o^, o.>,... can be 
either positive or negative, and one has 

so that the total error is zero if each partial error occurs in accordance w ith 
its probability. This is a consequence of the hypothesis, indispensable if 
the law of Laplace is to hold, that the negative errors are as probable as the 
positive errors. 

In reality, the errors do not occur n^pi times, but n^p^ -f- times, 
where i/j = \ ^hpiqi is the partial unit deviation and the relative deviation 
Xi satisfies the law of Laplace (see Section 5.1), and so forth. One concludes 
that the error is 

and reasoning analogous to that of Sections 5.6 and 5.7 yields 


T72 _ ,,2_2 I ,,z_z t _r_ 

LJ — ~T~ ^2^2 • • • I 

The quantity U thus defined is the unit deviation and the relative deviation 
follows the law of Laplace. It can be seen that the unit deviation U is the 
reciprocal of the quantity k which we referred to as the precision. This 
follows from the fact that the probability for A is 


. 2_2 


1 -i* 

Jit 


dX 


and, hence, that the probability for x = XU is 


Thus, one has 


1 

UyjTT 


dx 


A = 2 

k 

so that, roughly speaking, the precision is inversely proportional to the 
values of the elementary errors and to the square root of their number. If 
the elementary errors are all divided by 10, the precision is multiplied by 
10; if the elementary errors remain of the same size but become 100 times 
more numerous, the precision is divided by 10. These remarks will become 
more meaningful after we have studied the relationship between reality and 
the number k which we called the precision, and which until now has been 
nothing but the definition of an abstract term (see also Section 10.5). 
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To summarize, the law of Laplace is justified by the fact that when the 
possible errors are numerous, their combinations are subject to the law 
w hich we obtained by combinatorial analysis in the case where we studied 
a great number of repeated trials. 


10.4 Invariance of the Law of Laplace 


An a posteriori justification of the law of Laplace may be found in the 
following invariance property: when the errors of measurement of several 
(quantities indiiidually follow the law of Laplace, the same is true also for the 
error of their sum. It will suffice to prove this for two quantities, for which 
the probabilities of obtaining an error between jc and x + dx and y and 
V — dy are, respectively. 


^ -E*x* J J fc' -E'*.* i 

—=-e dx and ^=.e * * dy 
V"- yj'tr 

What is the probability that the error for the sum of the two quantities is 
contained betw'een z and z dzl If the error for the first quantity is equal 
to x, the error for the second must be contained between z — x and z — 
.X — dz, and the corresponding probability is 


yjTT 


dz 


Now, the total probability is obtained by multiplying this expression by 
the probability that the first error is contained between jc and jc + dx, and 
then integrating on .x from — oc to -!-oo. This will give an expression of 
the form y </r, where 

77 J-x 

and, using the results obtained in Section 5.6, this may be written as 



w here the precision k’ is given by 



3 


fc'* 


10.5 The Precision of a Series of Measurements 


Let us now demonstrate the significance of the coefficient k wluch we haw 
referred to as the precision. The probability that an error is contained 


between k/.i and k/.^ is 
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the (juantiw to be minimized becomes 

J__ o*/ n . n(n — I)i 

which equals 



The minimum corresjionds to 


and its value is 


a = 


1 




Thus, if one uses the formula 




one can expect to make the smallest error in estimating the precision. This 
is not of great practical significance: if n is large the two estimates are 
almost the same, and if n is small they are both very uncertain. 


10.6 H eights and Means 

The name weight is usually given to the square Ar* ot the precision Ar, and this 
is justified bv the solution of the following problem. 


PROBLEM 37. Two series of measurements are made to measure a giren 
length z, and their respective precisions are k and k'. Given only one measure¬ 
ment m of the first series arid one measurement m of the second series, how 
should one corvine these two values to obtain an estimate ofz? 


For obsious reasons, one is led to estimate r by means of an expression 
of the form* 

hm -h h'm’ 


• In faa the cxpraskm for x should satisfy the following conditions: (I) dm znim w 
muhiolied bv a constant, x should also be multiplied by that constant: 
exprrsswn should be homogeneous to the first degree; (2) if a 

to m . it shouW also be added to z; in other nxirds, the express^ should be linear 

»«il as homogeneous; (3) if « = m\ : must equal their common value. 
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evidently approaches 1 as it consists of a finite number of factors, each 
having the limit 1. We thus get 



and this is the formula of Poisson. It should be noted that 



since the quantity in parentheses equals e^. 

Regardless of the choice of x > 0, there can be any number of points P 
on the segment AB, and the sum of the probabilities equals 1, as it should. 
It should also be noted that when k is sufficiently large compared to x. P* 
will rapidly tend to zero, and the probability will become negligible after 
a certain value of k. Let us now consider the ratio 

Pjr-i-l “T I 

If X is not an integer, will increase until k equals the integral part of x; 
after that will be less than P^ and this will continue indefinitely. If x 
is an integer, the values of P^_, and P^ are equal and they constitute the 
maximum values of the function P^. 

Poisson’s formula may be applied to problems of repeated trials w hen the 
number of trials is very large, the probability of a success on an individual 
trial is very small, and x designates the total mathematical expectation. 
Without making an appreciable error, the formula of Poisson may thus be 
used when the number of trials is sufficiently large and the probability p is 
sufficiently small. An interesting special case arises when >1=1. For ex¬ 
ample, one might buy one lot each from 100 lotteries, w here each lottery issues 
100 lots, or one might bet 37 times on a given number in roulette, w here there 
are 36 numbers and zero. What is the probability of winning not even once, 
the probability of winning exactly once, exactly twice, and so forth? 
Rounding to as many decimals as show n, we have 

P^ = lie = 0.368 
Pi = Po = 0.368 
Pj = *Pj = 0.184 

^3 = i/** = 0.061 

^4 = kf*3 = 0.015 
= IP* = 0.003 
P* = iPi = 0.0005 


0.9995 
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hen / is sufficienUy large and. for the sake of simplicity, assumed to be 
a v^hole number, one can easily obtain the approximate formula 



and ii can be shown that the law of Laplace-Gauss applies to the deviations 
from 


11.3 The Choice of a Point on a Line Segment 


As has already been pointed out, we are led to say in the theory of discrete 
probabilities that a point is randomly selected on a line segment if the proba¬ 
bility that it is contained in any given intersal is proportio nal to the length 
of that inters al. Without going into the general theory- of measure, let us 
merely point out that as a result of this definition the probability that a point 
(restricted to the interval from 0 to 1) is co'ntained in a set consisting of a 
finite or countable infinite number of nonoveriapping intervals equals the 
sum of the lengths of the intervals. If the intervals overlap, the probability 
is less than this sum. Let us apply this to the rational points p q contained 
in the interval from 0 to 1. The point p q is contained in the interv-al 


P 


X 



w hose length is 2i w here x is an arbitrary positive constant. The number 
of intervals of this form for a given value of q is less than q^ since p is an 
integer less than q and relatively prime to q. Thus, the total sum of the 
intervals is less than 

* r 

and it can be made as small as one wants by choosing i sufiBdentiy small. 
It follows that the probability of randomly selecting a rational number on 
the interval from 0 to 1 is 1^ than any predetermined constant; in other 
words, it is equal to zero. 

The same argument applies to any denumerable set of points on the inter¬ 
val from 0 to 1, for example, to the set of all algebraic numbers. Ooe ato 
obtains denumerable sets bv considering all the numbers defined by an infirate 
series (or continued fraction, etc.), where the law for successive terms is 
<riv en in such a wav that, starting with a given term, they can all be ^ulat^ 
Since all other processes by which one can define a given number can 
written in at most a finite number of words (after all, it is humanly 
to write more than 1(H* words), one arrives again at a 
It foUows that the probabiUty is zero that a 

one which, starting with the integers, can effecuvely y 

process. 
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to permit the applicaUon of the law of chance. For example, one could 
assert that it is impossible for the last n digits (where n is contained between 
50 and 500) to dupUcate exactly a series of consecutive digits appearing 
among the first 1,000 — n digits drawn. If this were the case, one could 
assert that the decimal dcNelopment obtained has a tendency to be periodic 
namely, that it might represent a rational number. Of course, there are 
rational numbers for which the period does not begin until after a great 
number of nonperiodic digits, or for which the period itself covets more 
than a thousand digits. 

.More generally, the number of irrational numbers, defined in a simple 
that have actually been calculated to many decimals is very small. One 
could thus assert with certainu- that the first m digits drawn will not coincide 


w ith the corresponding digits for such numbers as sr, e, \ 2, and so forth. 

Impossibilities like these are brought out even more clearly if one sub¬ 
stitutes for the decimal notation an alphabetic notation, where the digits 
are the 26 letters of the alphabet. In this way, a number which has 1,000 
digits after the decimal point in the decimal notation would be equivalent 
to a number having barely more than 500 leners in the alphabetic notation. 
If the letters are drawn at random, one could assert that one will never 
obtain a text of 5(K) words appearing anv'where in the National Libraiy, or 
even in all newspapiers which have appeared or will appear throughout 
this century . The number of such texts certainly does not exceed 1(F®, and 
the probability of duplicating one of them is thus 10** • 10“^ = 10'^. 

It would be equally unlikely to obtain a sequence of letters making any 
sense w hatsoever. .Among the decimal numbers that one will surely not get 
by v irtue of the law of chance are, for example, those in which 50 consecu¬ 
tive dieits are all alike, those in which the digit 7 appears fewer than 20 or 


more than 200 times, and so on. 

Persons buving lottery tickets often refuse to buy tickets with very un¬ 
usual numbers, say, 555,555. They are under the impression that this numto 
will not win, which is correct in the sense that the probability of its wirming 
is 1 in a million and. hence, negligible from the human perspective. In other 
words, it would be entirely unreasonable to count on winning with the 
ticket in order to pay a debt or do business. However, the person buying 
the ticket is in error in assuming that the ticket 141,592 has a better chance 
than the ticket 555,555. The two outcomes are equiprobable (or be^, 
equalN improbabkt, and in case Ihe reader is fainiliar »ilh the decimal 
expansion of e. he viill agree that 141.592 is even more special, ot more 
peculiar, than 555.555. Had »e considered 100 digiB rather than 6 digits, 
tire probabilities .ould have been negligible from the supercosmtc persf^ 
live and vve can assert that «e will never obtain a number consisting of IW 
conlecutive fives or the first 100 digits of the 
one acluallv performs the dra.ings. the num^ 

a priori as inlikelv as any other lOthdigit number; this is no paradox, smce 
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the number was not specified in advance and some number has to be obtained. 
Similarly, if one randomly selects 100 letters of the alphabet, one must 
obtain some particular sequence of 100 letters; what is improbable is that 
the 100 letters will constitute a phrase which is meaningful in a gisen 
language. It is similarly improbable that the 100 digits obtained belong to 
those sequences which are considered “special” by mathematicians. 

If, then, to choose a number on the internal from 0 to 1, one actually 
draws a great number of digits, one can be certain that the digits thus ob¬ 
tained will not follow any given rule that is either obvious, as in the case of 
a repeating decimal, or hidden, as in the case of decimal e.xpansion of e or 
77. There is thus no chance that after one has made 1,000 drawings one will 
be led to believe that one has obtained (as nearly as piossible) a simple 
rational number or a knowm irrational number such as e, rr, \ 2. etc. In 
fact, a mathematician would be as bewildered by such a number as a reader 
would be by a sequence of 1,000 letters. It is extremely improbable that he 
would be able to indicate any property connected in any way with a mathe¬ 
matical theory concerning given numbers. 


11.6 An Infinite Number of Drawings 

The whole matter appears in an entirely different light if one considers an 
infinite number of drawings; it is no longer a question concerning the law 
of chance, as it is assumed that no obstacles are presented bv numbers w hich 
are too large to be accessible. 

We know, by virtue of the application of the law of large numbers to 
iiifimtely many repeated trials, that the relative frequency of each of the 
digits 0, 1,..., 9 has the limit 1/10, and that it approaches this limit very 
rapidly. It is clear, however, that instead of drawing from among the 10 
digits in the decimal notation one could also draw from among the 100 
combinations 00, 01, ..., 98, 99, or from among the 1,000 combinations 
obtained by taking three digits at a time. If follows that if one is given a 
set of n arbitrary digits, one can assert that no matter how large n might be 
the set has the same relative frequency as each of the 10" possible sets of n 
digits. Thus, a set of 1,000 consecutive fives is not impossible, but it has the 
^e relative frequency as each of the 10*«» sets of 1,000 digits. It would 
necessary' to point out to the reader that to be able to write 
these 10 sets of1,000 digits it would not even suffice to fill the universe w ith 

r^s of paper having the thickness of an atom, while using svmbols which 
themselv^^ as small as an atom. This would accomodate only a small por- 
uon ofallthce numbers. We are thus dealing with theoretical ;pectulat.^s 

concerning infinite^ometricai 
figures, imaginary figures, or multidimensional spaces 

'f •>>* relnlirc frequence for 
each of the 10 digits tends to I/IO, and as absoluiely nornwl HMrespeci ,o 
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11.7 Generalization of the Problem of Choice 

We have seen the difficulties posed by the problem of randomly choosing a 
number from a continuous set. The difficulties are even greater if one slants 
to choose infinitely many numbers and their possible combinations. For the 
study of problems of this kind, some of which are related to the axiom of 
choice of Zermelo, 1 must refer to some of my other writings, mainly to my 
Elements of the Theory of Sets. Indeed, the theory of continuous probabilities 
has many analogies with the theory of the measure of sets, and in the study 
of certain questions one can sometimes employ the language of measure and 
sometimes the language of probability. Let me refer again to the book 
mentioned above, especially to Note II on probabilities in denumerable sets. 
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CHAPTER TWELVE 


The Discrete Case 


12.1 Some Simple Problems 

PROBLEM 38. Um A contains S white balls and I black ball, and um B con¬ 
tains I white ball and 3 black balls, if one ball is dran n from one of the urns, 
chosen at random, and it is observed to be white, what is the probabilitv that 
it ctane from um A ? 


Since the two urns contain the same number of balls, the probabilitv of 
drawing any one of the eight balls is the same. There are 3 white balls in A, 
1 in B, and the probability that the white ball came from A is. therefore, 3 4; 
this is the required probability. 

One might also argue as follows: Imagine 40 pairs of urns .4„ A^ __ 

^ 1 * • • •» where each um A, has the same composition as urn A 

and each urn has the same composition as urn B. Suppose that one urn 
b randomly selected from each pair and that a ball is drawn from each of 
these urns. If 20 of the urns chosen have the composition of urn A and 20 
have the composition of um B, the most likely result is that the former will 
yield 15 white balls and 5 black balls while the latter will yield 5 white balls 
and IS black balls. One thus obtains the following: 

20 drawings from A yield 15 white balb and 5 black balb 

20 drawings from B yield 5 white balls and 15 balb 

40 drawings yield 20 white balb and 20 Mack balb 


**K^"|.**?. ***' 20 drawings furnishing the 

white bails 15 came from urns like A and 5 came from urns like B; hence. 
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12.2 The Formula of Bayes 

Consider n urns among which /ij, called urns have the same proportion 
/>! of white balls, n^, called urns have the same proportion />, of >Ahiie 
balls,..., and w*, called urns have the same proportion /»* of \shite 

balls. The n = — ... — urns are otherwise identical, and one 

ball is drawn at random from one of these urns. If the ball drawn is white, 
what is the probability that it came from one of the urns .-I, ? 

Suppose that each urn contains N balls, so that each of the urns •<, con¬ 
tains piN white balls,* each of the urns A 2 contains p,\ w hite balls, etc. 
It follow's that the total number of white balls is 

— ... — n^Pt^V 

among which ftiPiN come from one of the urns Ai. Hence, the probability 
that the white ball came from one of the urns Ai is 

p ^ _ ”iPi^ ___«i£i_ 

ftiPiN -r . - . -f n,p, -!- n^p* -t- . . . — n^p^. 

Letting = Uj/i, — w^n ,..., and n* = h^/i, this formula can also be 
written as 


= - P2!h - 

p,Hi 4- p.w, -f- . .. -!- Pt^’t 

where the significance of the numbers is easily explained. 

If one randomly selects one of the urns, the probability of obtaining one 
of the urns Ai is h-j, since is the ratio of (the number of urns .4,) to 
#i (the total number of urns). One might thus call h\ the a priori probabilitv 
(that is, prior to the drawing) of choosing an urn of the kind A^\ the prob¬ 
ability Pi we have calculated is correspondingly called the a posteriori 
probability, and the formula for P, is called the formula of Saves. It can be 
derived by using the following argument, which makes use of the theorems 
for total and composite probabilities. 

The probability that the ball drawn is white and that it comes from 
can be evaluated in two ways, depending on the order in which one satisfies 
the two conditions that (1) the ball is white, and (2) it comes from 4,. 

(1) The probability that the ball is white is a total probability since the 
white ball can come from one of the urns /!„ from one of the urns A^. etc.; 
hence, the value of this probability is 

-h ... A- PtH* 

On the other hand, the probability that a ball comes from Ai given that it 
IS white was denoted P„ and it follows that the required probability is 

- - . -t- p^w^) 

assumed that the number Af is chosen so that ^.A,.... and are UI 



LRETt C 


^ n4 the DrSCRifTE 124 .HE DBEn,ETt ^ 


124 THE DlH 



44 “i 44 ”i 4 


fti&faJ^^^Wfififtlrtl»tal^^«ifft|lisfiitaiifal|^#«tMi^ 


of t/ie urn ‘: o# |4f urn 4 tjf the urn ^ 

hrtjrt^fe ttAffiiiwi»ilwi rtT«riit# ^4hmtjniM:*«l hiT«|iii;^6ttia«ii^ft<«i 

EpIajfeiJtiwIn^trtWf fw^pttffhl^iMttrtpTOftwfwIaiafBuiiitll^fctiifflfh^fc 
‘^#Ayli«WftiwMilteir*ieri:iiidismifcl»M.^wtr»lerfi^iWfaiUBMiw»tr 
ving special erfsehe following special olsclie following special crfschr folios 
p* Aimliifia^|t|i#iiis^i#: ftl^ltiTal^ilti^ippiift ®|^lllii»^it|i«iipp 

jsslbilitics: the sesen possibilities: the seven possibilities: the sescn pi 

biacfe: Aj^o ®^W white and 0 blacS: a^n# white and O black =rtwrbitc and O 

/5b 25 /Sb 25 /5b M 

black agi 1 black api 1 black pg, =^^|itt|pd i 

/4b 16 /4b 16 /4f 16 

black a blank wpa —E^^|itepid a black p^s -^^pitc^d 2 

.._, lA '> _ ._0Y_1 


OV » 



SEC. 12.2 


the formula of BAYES 125 


Designating the a priori probabilities of these seven cases as M j, h*. 

and We, we find that their a posteriori probabilities Pq, Pi, ■ and P^ are 
given by 

„ _____ 

® 36wo + 25wi + 16wj + 9w, -|- 4 h' 4 ^ Wj 

25 W|__ 

p — -——i--- 

‘ 36w„ + 25wi + 16w, + 9w, + 4 h’4 + h-s 


36wo + 25tv, + 16w* + 9 h-, + 4h-4 + 

P. = 0 

It is apparent from these formulas that we can specify H j,, M j, ..., and m-j in 
such a way that Pq, Pj,..., and Pj will assume any arbitrary set of values 
whose sum equals 1. (Pe must equal zero, for if one draws white balls it is 
certain that the balls cannot all be black.) 

To complete the solution of the problem, one must thus make some 
assumption concerning the values of h-q, h*j, ... , and w,. One plausible 
possibility is to assume that the a priori probabilities are all equal, namely, 
that one randomly selects one of seven urns having, respectively, the seven 
possible compositions. This would give 


p _ 3-5. p _ 

-To = 


P, = ^ 


•1, 


p _ -5- 




p, = o 


Another possibility is to assume that the urn has been filled at random 
by means of successive drawings (with replacement) from an urn containing 
the same number of white balls as black balls. By virtue of Pascal's triangle, 
one obtains 


Wq - H’j — 44, 

and, hence. 


»*’l = »*'5 = 6®4, 


= 1^4 = Jl, 


u- _ »# 

”3 — *4 


Po = 


6 

112» 




— * 5 . 

- 112 , 


— 40 

— 112 , 


P» = 


44 

112 , 


P* = i¥2, Ps = j\l. 


P. = 0 

It can be seen that this result differs considerably from the preceding one. 
The most probable alternative is now that the urn contains 4 white balls and 
2 black balls, since the a priori probability of this alternative is much greater 
than that of the alternative that the balls are all white. 

Let us suppose now that r = 5 = 1,000, namely, that 2,000 drawings have 
yielded as many white balls as black baUs, and let us look for the proba¬ 
bility that the proportion of white balls in the urn is p, (The proportion of 
black balls is correspondingly q = \ - p.) U p is the true proportion, the 
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expected number of while balls is 2.000/j and the observed deviation is 

2,000/? — 1,000 = 1,000(2/? — 1) 

Since the unit deviation is 

V4,(XK)/?9 = 20\ \0pq 

the relative deviation / is 


. _ l,000(2p- 1) _ (2p- 1) 

lQsJ\^pq ,,,/lOpg 

and it is known that the probability of ). exceeding 5 is very small. Such a 
large value of A can thus be ignored, unless the corresponding a priori 
probability is very high. Letting p = ^ -f a and ^ J - a, and ignoring 
terms in a^, one obtains 


A = 


200a 

Vio 


= 20ayT0 


and it can be seen that A exceeds VlO when a exceeds 1/20. Let us now 
suppose that the a priori probability of a assuming a value between —1/20 
and 1/20 is proportional to the length of the corresponding interval. Thus, 
the probability that one observes a relative deviation A between 20av'^ 
and AOas/lO -f dX, given that a is contained between a and a — dx, is 


-4 dX 

yjTT 

The probability that a is contained between a and x -- dx \% thus 


1 —4, 


e-4.ooa*- 

Jtt 



ri/20 

e-4 000*- 

yfjr • 

/-1/20 


= V4,000c‘^ ®®®** dx 


where the limits of integration in the denominator can be put equal to — oo 
and -1-00 without introducing an appreciable error. 

The probability that the proportion of white balls is contained between 
0.500 and 0.501 is obtained by substituting a = 0 and dx = 0.001, which 


gives 


^/ 4;000 ^ 2^10 
1,000 ~ 100 


Similarly, the probability that the proportion is contained between 0.510 
Ld 0.511 is obtained by substituting a = 0.01 and dx = 0.001, which gives 


V4,000 


1,000 
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Finally, the probability that the proportion is contained between 0.525 
and 0.526 is obtained by substituting a = 1/40 and dx = 0.001, which gives 

V 4:000 

1,000 

and this is about 10 times smaller than the preceding value. 

(2) Let us now suppose that r = 10, 5 = 0, and that the urn has been 
filled by means of 800 random drawings from another urn having as many 
white balls as black balls. The a priori probability for the urn to contain 
400 + h white balls is 

since the unit deviation equals 20. In order to determine the most probable 
composition of the urn, let us observe that the probability of getting 10 
white balls in 10 drawings is 

/ 400 -f h 

I 800 / 


so that we must find the maximum value of the product 

^ g-A*/4oo /400_2r_fi \ ® 

40 ^/^; \ 800 / 


Except for a constant, the logarithm of this quantity is 




400 


and equating to zero its derivative with respect to A, we obtain 


10 

400 + h 



/i* + 400fi — 2,000 = 0 


^ = —200 -h V'^42,000 = 200[—1 -}- V1 -f- 1/20] = 5 (approximately) 


Hence, the most probable composition is 405 white balls and 395 black 
balls, and it should be observed that drawing 10 white balls in a row docs 
not add much to the likelihood of a greater difference in the composition of 
the urn. In other words, owing to the manner in which the urn was filled, 
it is very improbable that the number of white balls in the urn is much 
greater than the number of black balls. 

The situation would have been different if 100 consecutive drawings 
(with replacement) had all yielded white balls. If this actually happened, 
one should first investigate whether there is not some error in the experi¬ 
ment; such a result is extremely unlikely. If the results are acceptable, one 
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has to mxximize the expression 


and this gives 


100 - log (400 ~ h)- — 

400 


= h*-400.- 20,000 = 0 


A — —2(X) — \ 60,000 = 44 (approximately) 


12.3 Applications 

There are situations where the evaluation of the a priori probabilities is 
simply a matter of estimation and the results will, of course, depend on the 
values of the estimates. The following is a classical example. 

PROBLEM 43. Peter plays Ecarte n ith a stranger who turns up a king the 
first time he has the deal. What is the probability that the stranger is a 
professional cheat? 


In the solution which Poincare gave this problem he assumed implicitly 
that a professional cheat would turn up a king ev’ery time he has the chance. 
.\lthough I have no personal experience in this matter (and neither did 
Poincare), it would seem to me that this assumption is not very likely. An 
honest plaver turns up a king on the average 1 ,'8 of the time, and it would 
seem that someone who has such “good luck" each time w'ould rapidly 
cause suspicion. It would seem more reasonable to estimate the probability 
that a cheat will turn up a king, say, as 1/4, whereas for an honest player 
this same probabilitv is I '8. 

Let us designate bv w the a priori probability that Peter s adversary is a 
cheat. The a priori probabihty that he is honest is thus 1 w, and the 
Q posteriori probabihtv P that he is a cheat (after he has turned up a king) is 

_Jw__ 2 w 

“ Ah — K1 — »*) > ** 


If H- = 1 2, namelv, if there is a fifty-fifty chance that Peter’s adversary fe 
a cheat, one obtains> = 2 3, and thb shows that the single observation h^ 
modified the probability to quite an extent. However, it should also be 
noted that it would be vwy imprudent of Peter to play with an oppo^t of 
whose moralitv he has such a low opinion. The fact ^t Peter » 
plav suggests that he regards h as being very small. If w b smaU, P di^ 
ver^ little from 2h, and in any event it is less than 2ir, ttot is, the P«> " 

bilitv is less than doubled by the observ«l event. If 
that H = 1/100.000, then P = 1/50,000. that is, P B 

as w. If Peter is sure of his partner and assumes that ir - 0, then P b also 
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ax rondel and one of them has served longer Thai the other 
probability that he has served for three teas? 


ewer. 12 
wAor is ifcr 


Lei a be the number of soldiers who have served for one vear. 6 the number 
Cl soldiers vvho have served for two vears, and c the number of soldieis who 
ha\e >er\ed for three \ears. Given one sokiier wtK> has served two 
and ar.oiber soldier sekcied at random, the probabOitv that the' first 
soldier has served longer than the second is 


_ a _ 

a — b — c — \ 

since the second soldier is one ofibea — * — c—1 who remain after the 
first soldier has been selected, and since among these there are a favorabte 
cases. Similariv\ the probabilitv that a soldier who has served for three 
vears is older than a second sokiier selected at random is 

a-b ' 
a — b — c — \ 


On the other hand, the a priori probabilities of sdectins a sokiier who ha^ 
served two years or three years are. respectiveiv.* 


a 



and 



and the a posteriori probability that the sokiier wiio has served kxmer 
has serv ed for three years is 

c a — b 

_ a — b — c a — b — c — 1_ 

c j — b b _ a _ 

a — b — c a — b — c — 1 a — b — c a — b c — 1 


nameiv 


da — fr> 
da — bi — ab 


_ah_ 

be — ca — ab 


This probabilitv can also be written in the fwrn 

1 _ 1 

a b 

I _1 _1 

a b c 


• » ae need lo aMisider the case where she first soUiff hassoved 

since this wouid make « E3f»ss»bie for ten lo have served locker than the other 
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If a — h — c, the probability is 2 3; if t» «» 2A 4<. that is. if each year 
has half as many as the preceding year, the probability is 

i 4- I ^ 3 
i f i 4- I 7 

which is less than 1/2. In this case it would be adsaniageous to bet that the 
soldier who has served longer has served two years, since there are twice as 
many of those as there are soldiers who have served three years, and twice 
as many who have served one year as there are soldiers who have served 
two years. 
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How does one account for these extensive fluctuations ? The first explana¬ 
tion that comes to one’s mind is that the population on which the data 
are based is not homogeneous; evidently, mortality statistics depend on age, 
sex, social conditions, standard of living, and so forth. However, we know 
(from Sections 5.5, 5.6, and 5.7) that the consideration of several distinct 
groups which one regards as one population will tend to diminish. ne\er 
increase, the relative deviation. One must thus look for another explanation. 
An analysis of the conditions suggests that the number of deaths is trecjuently 
increased by epidemics, periods of e.xtreme heat or severe cold. etc. Such 
phenomena can account for a great number of simultaneous deaths which 
cannot be regarded as independent. Looking at things from a purelx ab¬ 
stract point of view, there seem to be ties among the balls, so that one 
cannot draw one ball from an urn without also drawing others. This would 
justify the e.xistence of greater deviations. Suppose, for the sake of argument, 
that an urn contains 40,000,000 balls of which 800,000 are black, and that 
40 million persons are asked to draw one ball each with replacement. The 
expected number of black balls would be 800,000 and the unit deviation 
would be 


yflnpq = 


1,000 • 7v 80 

50 


1,260 (approximately) 


since n = 40,000,000, p = \ /50, and q = 49/50. 

Suppose now that one makes only 40,000 drawings, but that each drawing 
of a black ball is considered worth 1,000 (black balls). The expiected number 
of black balls drawn would be 800 and, in accordance with the convention, 
this would represent 800,000 black balls. The unit deviation w ould be 

y80,000 • ^ ^ ^ ^ = 40 (approximately) 


where, in accordance with the convention, each unit corresponds to 1,000 
black balls. The unit deviation would thus be equivalent to 40,000 and, 
hence, a deviation of 40,000 from 800,000 would be looked upon as quite 
probable. It can thus be seen that this hypothesis can account for con¬ 
siderable deviations. Let us now leave the problem of mortality statistics to 
which we have been led by the preceding considerations, and let us examine 
briefly how one might pose the general problem of statistical probability. 


13.6 The Model of the Ums 

In order to avoid unnecessary verbal compUcations, let us consider series of 
observations made from the same number of possible cases, for example, 
from a constant population. For each series one records the number of 
times a cer^n kind of event, called a success, has occurred. This terminoloev 
is conventional, and the term “success” may apply to a birth, a death, a 
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since the probability for tails is r and the probability for heads is I - r. 
Preserving the notation of Section 12.2, the numbers m* (the a priori prob¬ 
abilities) are equal to dz^ and the numbers are given by expression (1). 
Applying the formula of Bayes, namely, 

p = P**** 

we cancel the factor independent of z which is common to all of the snd 
we obtain 

z-(\-zYdz 

C' 

J^z'd - zydz 

This is the a posteriori probability for .r to be contained between z and 
z -f- </z, namely, the probability after the observations have been made. No 
matter what assumption one makes about .r, the probability P is an infini¬ 
tesimal of the same order as dz. In other words, the probability for any 
particular value of x is infinitesimally small, and we w ill have to consider a 
finite interval, so that the probability that x is contained in the interval is, 
itself, finite. 


14.2 Special Cases 

PROBLEM 50. J/in the game defined in the preceding section two trials yielded 
two tailSy what odds should one give that this result is due to the fact that for 
the given polyhedron the probability of getting tails exceeds I \2? 


Substituting p = 2 and n = 0 into the formula for P, we obtain the 
preliminary result that 



= 3z* dz 


To obtain the probability that x exceeds 1/2, we have only to integrate 
between 1/2 and 1, and this gives 


3z*dz = ^ 

Jt/t 8 

Thus, the probability that for the given polyhedron the probability of 
getting tails exceeds 1/2 is 7/8, and the appropriate odds are 7 to 1. 


PROBLEM 5\. If n andp are large in Problem 49 and one lets t = ^ ^ 

what is the probability that x is contained between t - v and t ~ v where v 
is very small? ' ’ ^ 
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^ noted that the given value of t maximizes the product 
“ ' *' ^nd, hence, the element of probability P on page 141. This 

result is obtained by equating to zero the logarithmic derivative of the 
product; that is, one gets 


and, hence. 



- — ^ ~ ^ _ 1 

P n n + p 
In order to evaluate the integral 

Jp.n =//”(! - 2 )" dz 

we use the formula 



- 2 )-] = (p - 1 ) 2‘’(1 - z)" - nz^\l - 


z)" ^ 


which, integrated between 0 and 1, gives 

® ~ (P tHrl.n—\ 


since the integral of the left-hand member is zero when p -f 1 and n are 
positive. This last result can also be written as 




n 


and it follows that 
n 


P 1 


1 


Jp.n = 


n — \ n — 2 


1 


Since 


1 p —2 p -r 3 n -I- p — 1 n -f p 

1 


' «+p.o 


one finally obtains 


'. 


•^^dz = 


n -f p -f 1 


n\p\ 


(n -f p -f 1)! 


and the cleinent of prc 

( 1 ) ' 

If we now let 2 = / 
value approached by It 
Section 5.1. Omitting a 

(2) 


ty P can be written as 
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n!p! 


dz 


where y is very small, we can easily obtain the 
by means of calculations analogous to those of 
;tail, let us merely indicate the result; letting 
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we get, as a first approximation, the following value ot P: 

r, 1 -A* J* 

(3) P = — e d/. 

N ^ 

This is the formula of Laplace-Gauss, with the unit deviation defined b> (2). 
The required probability is thus given by 6(x), with /. determined by y in 
accordance with relation (2). 

No matter how small y might be, the value of /. defined by (2) increases 
when n and p are increased in such a way that their ratio remains unchanged. 
Hence, the probability that x is contained between i — i and t — y apv- 
proaches unity. It follows that if the number of trials is very large, one can 
(as would seem reasonable) consider the result as furnishing a very precise 
indication of the unknow n probability. 

Let us givea numerical e.xample, remembering, of course, that the preceding 
results are based on the assumption of equal prior probabilities. Suppvose 
one has 

p — 800 and /i = 100 

For these values formula (2) yields / = 67.5y and, letting y = 0.04. it can 
be seen that the value of ^(2.7) differs from unity by a little more than 

P 8 

0.0001; in fact, it equals 0.99986 .... Moreover, - — - = 0.888 . . . , 

n — p 9 

and one can conclude with a probability of at least 0.99986 that the 
probability of tails is contained between 0.849 and 0.929. 

Had we used the values p = 80,000 and n = 10,000, we would have 
obtained ). ^ 675y and hence, putting y = 0.004 (w hich is ten times smaller 
than before), we would have arrived at the same value of 6(7.). It follows 
that the probability for tails is now contained between 0.885 and 0.893, and 
it can be seen that by increasing the number of trials one narrow s the limits. 
It follows from the preceding calculations that the difference between the 
limits is inversely proportional to the square of the number of trials. If one 
defines precision as inversely proportional to the difference between such 
limits, one can say that the precision is proportional to the square root of the 
number of trials (see Section 10.5). 

14.3 The True Value of a Measured Quantity 

When one has observed various measurements of one and the same quantitv. 
the problem of determining its actual value is a problem of the probability 
of causes, since the actual value is evidently the principal cause which 
determines the values of the measurements. Given a certain measurement, 
we could use the results of Chapter 10 and evaluate the probability of ob¬ 
taining that measurement, assuming that the quantity being measured has 
some arbitrary value. We could then obtain the a posteriori probability. 
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a author s language and style. Each writer has special habits concerning 
the length of words and phrases, which can easily be analyzed numericallv 
Such matters can be studied by means of interpolations from the texts, but 

questions of metric can also be phrased in statistical terms and solved with 
statistical techniques. 

The methods of the theory of probability have also been applied to ques¬ 
tions of art. For instance, various schools of sculpture may be characterized 
by the sizes of certain ratios among the parts of the human body, and 
numerical deviations from such ratios may make it possible to differentiate 
between the different schools. 

In all these questions the role of mathematics is mainly that of furnishing 
the assumptions which must be made in order to subject the problem posed 
to an appropriate analysis. In spite of their interest, we shall leave these 
questions and others aside, and limit ourselves to certain details concerning 
applications that are more truly scientific; that is, we shall limit ourselves 
to those sciences which have been considered as such for a long time. 


15.2 The Distribution of Stars on the Celestial Sphere 

This is one of the most important questions of natural philosophy, as it 
touches upon the origin and the destiny of our universe. Except for some 
special cases, we shall ignore the distances between the stars, and this pre¬ 
sents the first difficulty: If two or more stars are drawing together in the 
sky, can one conclude that they are also drawing closer together in space? 
There is no question here of giving a rigorous answer, but the study of proba¬ 
bility can, in certain cases, give a very strong indication in favor of the 
affirmative. Unless one allows for regular arrangements analogous to those 
found in crystal structure (that is, arrangements whose existence one has no 
reason to suspect), it is clear that if two or more stars are extremely distant 
in space, the probabilities that their positions on the celestial sphere are 
approaching one another can be calculated in accordance with the princi¬ 
ples of Section 8.6. Then if one counts only stars of a certain size, for 
example those visible to the naked eye or with a telescope, one could cal¬ 
culate the probability that a given number of stars are situated inside a 
small circle on the celestial sphere. If there actually are groupings for which 
the probability thus calculated is very small, we would be justified in thinking 
that the grouping has a cause other than chance, namely, that the grouping 
consists of stars that are actually approaching each other in space. 

Although they do not seem convincing, Bertrand has raised the following 
objections to this way of thinking: The Pleiades [he said] seem to be closer 
to one another than would be natural. This assertion is worthy of interest, but 
if one wants to translate the consequence into figures, one runs into difficulties. 
In order to make this “ drawing closer" precise, must one look for the smallest 
circle containing the group? The greatest angular distance? The sum oj the 
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squares of all the distances? The area of the spherical polygon nhosc vertices 
are some of these stars while the others are in its interior ? So Jar as the 
Pleiades group is concerned, these quantities are all much smaller than should he 
expected, so which one gives the appropriate measure of unlikelihood^ IJ three 
of the stars happened to form an equilateral triangle, must one include this 
{which surely has a very small a priori probability) among the circumstances 
revealing a cause ? 

What has to be translated into figures is the probability that a grouping 
is random and, no matter what Bertrand might say, the result will depend 
only to a slight degree on the particular form given to the definition of a 
grouping: a very small circle containing the stars, a spherical com ex polygon 
and so forth. Practically speaking, it would seem reasonable to choose the 
definition which leads to the simplest calculations. Such calculations would 
lead to a result such as the following: Chance might yield such a grouping 
about once in 23,000. The moment this result has been stated, the theory 
of probability has done all it can and it must give way to induction and to 
the hypothesis one ultimately tries to verify (if possible) by other means. 

Let us also say a word about Bertrand's idea concerning the possibility 
that an equilateral triangle might be formed by three stars; it reduces to a 
question of rounding numbers. If one considers a number selected at random 
from between 1,000,000 and 2,000,000, the probability tha* it equals 1,342,517 
is one in a million, and the probability that it equals 1,. 00,000 is also one 
in a million. Nevertheless one would probably be inclined to regard the 
last one less probable than the first; this is due to the lact that one never 
considers a number such as 1,542,317 individually, but as a typ>c of number, 
that is, numbers which are all similar in appearance. If one transcribed the 
number and changed some of the digits, one would have difficulty distin¬ 
guishing, say, between l,324,5I9and 1,324,517. The reader will have to make 
an effort to be sure that the four numbers given in the p-eceding discussion 
are all different. 

When one has observed a number such as the preceding one in the 
evaluation of an angle to the nearest tenth of a second, one would not think 
of asking for the probability that this angle is exactly 13'’42'5I.7'. Neverthe¬ 
less, the angle must have some value, and no matter what this value might 
be to the nearest tenth of a second, one can only say that a particular 
me^urement has an a priori probability of 1 in 10 million and, hence, that 
it is extremely unlikely. This is a sophism of the kind that tainted the 
analysis of scriptures which was popular in the nineteenth century. 

The question is to know whether the same reservations apply also to the 
case where one of the angles of a triangle formed by three stars has an 
unusual value, say, where it equals 60 degrees (an angle in an equilateral 
tnangle) or 45 degrees (half of a right angle). What one must say about 
this subject is that one must be very suspicious of the tendency to call 
something unusual unless it has been specified in advance; the number of 
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this book we shall merely indicate two of the most important ones, those 
dealing with the homogeneity of populations and those dealing ssuh the 
subject of correlation. 


Suppose one measures the waist of a great number of French adults. If 
one looks upon the mean of these measurements as the true value of the 
waist of a Frenchman, one finds that the “errors.” namely, the positise and 
negatise differences between this theoretical value and the obsersed \alues. 
distribute themselves in accordance with the law of Laplace. The situation 
appears as if all Frenchmen had exactly the same size waist, equal to the 
mean, and that they w ere being measured by a very incompetent experimenter, 
so that the errors of measurement follow the law of Laplace. Using the 
language introduced in Section 10.3, we could say that the waists of French¬ 
men constitute a normal population. Biologically, this is in accordance with 
the fact that the French people constitute a sufficiently homogeneous 
biological group. 

Suppose now' that one performs the same experiment in a town where 
the inhabitants belong to two different races—say, some of them are Cau¬ 
casians and some of them are orientals.’If one measured all the adult 
inhabitants and recorded the measurements without recording the race, one 
would find that the mean no longer has the same significance and that the 
distribution of the values about the mean is not as simple as the one given 
bv the law of Laplace. One obtains the superposition of two distinct pheno¬ 
mena, each follow ing the law of Laplace, which together do not follow this 
law. The waists of the first subpopulation are grouped about their mean in 
accordance w ith the law of Laplace, and those of the second subpopulation 


are similarly distributed about their mean. The over-all mean has no real 
significance; it may very well be that it does not even represent the value 
having the highest frequency. This expresses the biological fact that there 
are two distinct subpopulations, and had this not been known it would 
have been revealed by an analysis of the distribution and its mean. 

Without delving deeper into this subject (to which one could devote a 
w hole book), it should be noted that the experimental study of analogous 
phenomena leads to the following general conclusion: The biometric charac¬ 
teristic intrinsic to normal series is the purity of the corresponding populations 
We shall not go into the detailed discussion that would be required to justify 
such a eeneral statement; it is up to the biologist rather than the mathe¬ 
matician to make these ideas more precise. Be this as it may, the t eory 
has proven to be of great scientific value and useful in many pra^'ca' 
applications. Let us mention just one instance, the 

irsvveden « ith dilTerent varieties of barley. Applying the statistical concept 
of the purity of populations they obtained an extremely pure strain of barley 
whose constancv Ls very much appreciated by the brewers by whom the 
scientists w ere employed. In a situation like this the question of homogeneity 


is of great industrial importance. 
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This is precisely what amateurs at roulette tr\ to do; they hope to dis¬ 
cover a law controlling successive results, but they fail because a roulette 
wheel does not obey any laws except those of chance. The situation Mould 
be different if an intelligent player accounted for all previous plass, sometimes 
choosing even in preference to odd (or vice versa) in accordance Mith the 
observations he has made in the preceding trials, especially Mith reference to 
the nature, the number, and the arrangement of runs. HoMcver, the moment 
an intelligent opponent discovers the law that is used to imitate chance, 
he could adjust his play accordingly and, thus, assure himself of a definite 
advantage. The player who wants to imitate chance is thus at a disadvan¬ 
tage, and this would not have happened if he actually made phvsical or 
mental drawings to determine his play.* 

1.9 Application to Parlor Games 

In most card games which are called parlor games (bridge, poker, etc.), the 
interest in the game arises from the fact that it depends on chance as well as 
on the competence of the players. It is easy to see, however, that in most 
of these games a player who follows immutable rules, that is, a player w ho 
always plays the same way under the same conditions would find himself at 
a disadvantage; his oppionent might discover his mode of play and modify 
his strategy accordingly. 

These games are too complicated to develop for them a complete mathe¬ 
matical theory (which moreover would remove a good deal of their interest), 
but it should be noted that such a theory' would have to lead to fixed proba¬ 
bilities, that is, it would have to indicate for each situation how a player 
must choose between two or more alternatives with fixed probabilities. 

In fact, if a poker player always played the same way in a given situation, 
his opponents would be able to deduce information from his mode of play 
(without much chance of being deceived), and they could use this informa¬ 
tion to their advantage. Consequently, if one could establish a mathematical 
theory for correctly playing poker (which actually is too complicated a 
problem to be solved completely), the rules of such a theory w ould ultimatelv 
have to be formulated in terms of probabilities such as the following, per¬ 
taining to a situation which might arise during the course of plav: After his 
opponents have made certain known moves, a player with a given hand 
must drop out with the probability p’, pass with the probability p’. or raise 

an amount #•„ Tj, ..., or r, with respective probability of />,. />j.or 

Some of these probabilities may be zero, but their sum must alwavs equal I. 
Since a player cannot very well interrupt a game to make random drawings, 
he will have to be satisfied with mental drawings (see footnote), in which he 

• If one not have at one’s disposal material objects to perform random drawings. 
^ n^ht think of words or phrases, count the letters, and thus choose between even and 
OGG. This might be referred to as making mental drawings. 
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Although Poincare did not s\stematically use this notation of Ke^'nes 
(or an analogous notation), he did not ignore the fact that the probability 
of an e\ent can be different for two observers who do not have the same 
information concerning the circumstances surrounding the event. This is 
precisely w hat he meant w hen be said that certain probabilities are subjective, 
namely, that their values are exact only for a certain person. Nevenheless, 
he maintained that there are certain probabilities which have the same 
value for a great many people, since the body of knowledge referred to as 
A' bv K.e\ nes is practically the same for each. These are the probabilities 
which Poincare called objective and this terminology is correct, clear,.and 
appropriate, even though it might be contested by a metaphysician in whose 
eyes no human knowledge can be called objective. 

Considering the roll of a pair of dice, for instance, if the onlookers know 
that the dice are properly fabricated and that they are rolled with sufiicient 
force so that they do not immediately come to rest, the probability of each 
face is the same, equal to 1 6. and it is an objective |M'obability. 

The situation is the same if one considers the probability^ of being dealt 
a given hand in a card game (say, bridge or poker), provided the cards are 
thoroughly shuffled and dealt by an honest player. For all players or spec¬ 
tators. the body of knowledge K is the same: they all know the composition 
of the deck and the fact that the pierson w ho deals the cards is not dishonest. 
Thus, we conclude with Poincare that it b legitimate to speak of objective 
probabilities in certain situations, but this will not stop us from using the 
notation of Kev nes to study subjective probabilities. 


11.2 The Probability of Death 

Among the objective probabilities cited by Poincare are those used by 
insurance companies in establishing their rates. The regularity with which 
these companies pav dividends to their stockholders supports the objectivity 
of their calculauom. This is a partkrulariy interesting example, since prob¬ 
abilities concerning human life may be regarded as objective or subjective, 
depending on one's point of view . 

Insurance companies use mortality tables, prepared in accordance w^ 
precise rules which we shall not discuss. It is to these mortahty tables that 

thev attribute an objective value on the grounds of experience. . 

Nevertheless, insurance companies take precautions to prev>ent the^ured 
from modiMng the odds bv introducing subjective ele^B mto 
sideral.ons. ln fact, it fe evident that if a man kno« 

(perhaps faul) disease, he uould be interested in tnsunng he life to ben^ 
his heirs Thus, if the insurance company did not insist that ass 
uke^ ^«l esamination before the policv is 

“f persons .ith insurance .ould be much higher than the rates given m the 
table. 
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APPENDIX THREE 


The Petersburg Paradox 


III.l The Petersburg Paradox 

The problem we shall discuss in this section is the Petersburg paradox, which 
dates back to the eighteenth century; it has given rise to numerous disputes 
among eminent specialists in the calculus of probability. 

PROBLEM. Peter and Paul play the game of heads or tails under the following 
conditions: If Paul wins the first n — I games and Peter wins the nth, Paul pays 
Peter 2* dollars and the game ends. How much should Peter pay Paul to play 
the game, assuming that the game is to be equitable? 


The answer to this problem is unexpected and improbable, as the name 
paradox might suggest. The amount Peter must pay is infinite, which means 
that no matter how much Peter might pay, the game is to his advantage. 

We knovfc that Peter’s total mathematical expectation is the sum of the 
mathematical expectations corresponding to the various possibilities. If he 
wins the nth game (after having lost the first /i - 1), his gain is 2", the pro^ 
ability of thrs happening is 12", and the mathematical expectation is 1. 
This argument holds regardless of the value of n, which may be as large as 
desired.'' Thus, the total mathematical expectation is 


This series extends indefinitely and E is infinite. 

One can give two different answers to this paradox, depending on '^'heAer 
one assumes a concrete or an abstract point of view. Accounting for concrete 
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IV. 1 I alues of the Function ©(/) 
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